Text2LIVE: Text-Driven Layered Image and Video Editing

نویسندگان

چکیده

We present a method for zero-shot, text-driven editing of natural images and videos. Given an image or video text prompt, our goal is to edit the appearance existing objects (e.g., texture) augment scene with visual effects smoke, fire) in semantic manner. train generator on internal dataset, extracted from single input, while leveraging external pretrained CLIP model impose losses. Rather than directly generating edited output, key idea generate layer (color+opacity) that composited over input. This allows us control generation maintain high fidelity input via novel losses applied layer. Our neither relies nor requires user-provided masks. demonstrate localized, edits high-resolution videos across variety scenes. Webpage: http://www.text2live.github.io .

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Text-image Coupling for Editing Literary Sources

Users need more sophisticated tools to handle the growing number of image-based documents available in databases. In this paper, we present a system devoted to the editing and browsing of complex literary hypermedia including original manuscript documents and other handwritten sources. Editing capabilities allow the user to transcribe manuscript images in an interactive way and to encode the re...

متن کامل

Editing out Video Editing

paradigm shift in media production: the advent of computational media production that will automate the capture, editing, and reuse of video content. By integrating metadata creation and (re)use throughout the media production process, we’ll enable the mass customization of video. F or the majority of people to not just watch but make video on a daily basis, the current media production process...

متن کامل

Bayesian Scheme for Interactive Colourization, Recolourization and Image/Video Editing

We propose a general image and video editing method based on a Bayesian segmentation framework. In the first stage, classes are established from scribbles made by a user on the image. These scribbles can be considered as a multimap (multilabel map) that defines the boundary conditions of a probability measure field to be computed for each pixel. In the second stage, the global minima of a posit...

متن کامل

Design Issues for Line-Driven Text Editing / Annotation Systems

Recent research on interfaces driven by line-markings indicates that there are many potential benefits and applications of such interfaces. Benefits include the exploitation of users' handwriting skills and their skills in understanding handwritten marks. There are systems that have exploited one or the other of these benefits but not both. One application which would take advantage of both of ...

متن کامل

Advanced editing methods for image and video sequences

In the context of image and video editing, this thesis proposes methods for modifying the semantic content of a recorded scene. Two different editing problems are approached: First, the removal of ghosting artifacts from high dynamic range (HDR) images recovered from exposure sequences, and second, the removal of objects from video sequences recorded with and without camera motion. These editin...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-19784-0_41